Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 3392 |
| Missing cells | 13 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 665.8 KiB |
| Average record size in memory | 201.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 15 |
| Boolean | 1 |
| DateTime | 1 |
p_throws has constant value "L" | Constant |
pitcher_id has constant value "477132" | Constant |
ab_id is highly correlated with g_id | High correlation |
g_id is highly correlated with ab_id | High correlation |
inning is highly correlated with p_score and 1 other fields | High correlation |
o is highly correlated with outs | High correlation |
p_score is highly correlated with inning and 1 other fields | High correlation |
event_num is highly correlated with inning and 1 other fields | High correlation |
b_count is highly correlated with pitch_num | High correlation |
s_count is highly correlated with pitch_num | High correlation |
outs is highly correlated with o | High correlation |
pitch_num is highly correlated with b_count and 1 other fields | High correlation |
ab_id is highly correlated with g_id | High correlation |
g_id is highly correlated with ab_id | High correlation |
inning is highly correlated with p_score and 1 other fields | High correlation |
o is highly correlated with outs | High correlation |
p_score is highly correlated with inning and 1 other fields | High correlation |
event_num is highly correlated with inning and 1 other fields | High correlation |
b_count is highly correlated with pitch_num | High correlation |
s_count is highly correlated with pitch_num | High correlation |
outs is highly correlated with o | High correlation |
pitch_num is highly correlated with b_count and 1 other fields | High correlation |
ab_id is highly correlated with g_id | High correlation |
g_id is highly correlated with ab_id | High correlation |
inning is highly correlated with event_num | High correlation |
o is highly correlated with outs | High correlation |
p_score is highly correlated with event_num | High correlation |
event_num is highly correlated with inning and 1 other fields | High correlation |
b_count is highly correlated with pitch_num | High correlation |
s_count is highly correlated with pitch_num | High correlation |
outs is highly correlated with o | High correlation |
pitch_num is highly correlated with b_count and 1 other fields | High correlation |
code is highly correlated with type and 2 other fields | High correlation |
type is highly correlated with code and 2 other fields | High correlation |
b_count is highly correlated with pitcher_id and 1 other fields | High correlation |
s_count is highly correlated with pitcher_id and 1 other fields | High correlation |
on_3b is highly correlated with pitcher_id and 1 other fields | High correlation |
o is highly correlated with pitcher_id and 2 other fields | High correlation |
pcodes is highly correlated with pitch_type and 2 other fields | High correlation |
on_1b is highly correlated with pitcher_id and 1 other fields | High correlation |
event is highly correlated with pitcher_id and 1 other fields | High correlation |
on_2b is highly correlated with pitcher_id and 1 other fields | High correlation |
pitch_type is highly correlated with pcodes and 2 other fields | High correlation |
pitcher_id is highly correlated with code and 14 other fields | High correlation |
top is highly correlated with pitcher_id and 1 other fields | High correlation |
outs is highly correlated with o and 2 other fields | High correlation |
p_throws is highly correlated with code and 14 other fields | High correlation |
stand is highly correlated with pitcher_id and 1 other fields | High correlation |
ab_id is highly correlated with g_id and 3 other fields | High correlation |
batter_id is highly correlated with date | High correlation |
event is highly correlated with o and 4 other fields | High correlation |
g_id is highly correlated with ab_id and 3 other fields | High correlation |
inning is highly correlated with p_score and 1 other fields | High correlation |
o is highly correlated with event and 1 other fields | High correlation |
p_score is highly correlated with ab_id and 4 other fields | High correlation |
top is highly correlated with date | High correlation |
date is highly correlated with ab_id and 6 other fields | High correlation |
code is highly correlated with event and 2 other fields | High correlation |
type is highly correlated with event and 1 other fields | High correlation |
pitch_type is highly correlated with event and 3 other fields | High correlation |
event_num is highly correlated with inning and 1 other fields | High correlation |
b_score is highly correlated with ab_id and 2 other fields | High correlation |
b_count is highly correlated with pitch_num | High correlation |
s_count is highly correlated with pitch_type and 1 other fields | High correlation |
outs is highly correlated with o | High correlation |
pitch_num is highly correlated with b_count and 1 other fields | High correlation |
pcodes is highly correlated with pitch_type | High correlation |
p_score has 1318 (38.9%) zeros | Zeros |
b_score has 1862 (54.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-11-06 22:51:04.540975 |
|---|---|
| Analysis finished | 2021-11-06 22:51:13.886532 |
| Duration | 9.35 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 897 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015091774 |
| Minimum | 2015000768 |
|---|---|
| Maximum | 2015183800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 2015000768 |
|---|---|
| 5-th percentile | 2015005032 |
| Q1 | 2015045807 |
| median | 2015090365 |
| Q3 | 2015139176 |
| 95-th percentile | 2015172599 |
| Maximum | 2015183800 |
| Range | 183032 |
| Interquartile range (IQR) | 93369 |
Descriptive statistics
| Standard deviation | 53785.52681 |
|---|---|
| Coefficient of variation (CV) | 2.669135347 × 10-5 |
| Kurtosis | -1.195201581 |
| Mean | 2015091774 |
| Median Absolute Deviation (MAD) | 48781 |
| Skewness | -0.02087817978 |
| Sum | 6.835191297 × 1012 |
| Variance | 2892882894 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2015022708 | 10 | 0.3% |
| 2015162193 | 10 | 0.3% |
| 2015183799 | 10 | 0.3% |
| 2015172572 | 9 | 0.3% |
| 2015028322 | 9 | 0.3% |
| 2015168079 | 9 | 0.3% |
| 2015116076 | 9 | 0.3% |
| 2015057261 | 9 | 0.3% |
| 2015011143 | 9 | 0.3% |
| 2015016262 | 9 | 0.3% |
| Other values (887) | 3299 |
| Value | Count | Frequency (%) |
| 2015000768 | 3 | |
| 2015000769 | 5 | |
| 2015000770 | 5 | |
| 2015000771 | 3 | |
| 2015000772 | 3 | |
| 2015000777 | 1 | < 0.1% |
| 2015000778 | 1 | < 0.1% |
| 2015000779 | 5 | |
| 2015000783 | 3 | |
| 2015000784 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 2015183800 | 5 | |
| 2015183799 | 10 | |
| 2015183798 | 8 | |
| 2015183793 | 3 | 0.1% |
| 2015183792 | 3 | 0.1% |
| 2015183791 | 3 | 0.1% |
| 2015183785 | 5 | |
| 2015183784 | 5 | |
| 2015183783 | 4 | 0.1% |
| 2015183778 | 2 | 0.1% |
| Distinct | 221 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 496396.7556 |
| Minimum | 112526 |
|---|---|
| Maximum | 630111 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 112526 |
|---|---|
| 5-th percentile | 424325 |
| Q1 | 453211 |
| median | 493114 |
| Q3 | 543939 |
| 95-th percentile | 607054 |
| Maximum | 630111 |
| Range | 517585 |
| Interquartile range (IQR) | 90728 |
Descriptive statistics
| Standard deviation | 75201.10871 |
|---|---|
| Coefficient of variation (CV) | 0.1514939569 |
| Kurtosis | 6.323821593 |
| Mean | 496396.7556 |
| Median Absolute Deviation (MAD) | 46633 |
| Skewness | -1.408403215 |
| Sum | 1683777795 |
| Variance | 5655206752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 518934 | 69 | 2.0% |
| 434636 | 63 | 1.9% |
| 457763 | 59 | 1.7% |
| 501647 | 59 | 1.7% |
| 453568 | 58 | 1.7% |
| 571448 | 55 | 1.6% |
| 622110 | 54 | 1.6% |
| 493114 | 51 | 1.5% |
| 474832 | 48 | 1.4% |
| 460026 | 46 | 1.4% |
| Other values (211) | 2830 |
| Value | Count | Frequency (%) |
| 112526 | 5 | 0.1% |
| 133380 | 33 | |
| 150029 | 13 | 0.4% |
| 150212 | 3 | 0.1% |
| 346798 | 2 | 0.1% |
| 400085 | 12 | 0.4% |
| 405395 | 31 | |
| 407781 | 31 | |
| 407812 | 11 | 0.3% |
| 407822 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 630111 | 6 | 0.2% |
| 628356 | 15 | 0.4% |
| 628333 | 6 | 0.2% |
| 623143 | 6 | 0.2% |
| 622110 | 54 | |
| 621043 | 10 | 0.3% |
| 608700 | 8 | 0.2% |
| 608671 | 3 | 0.1% |
| 608596 | 8 | 0.2% |
| 608365 | 23 |
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| Strikeout | |
|---|---|
| Groundout | |
| Single | |
| Walk | |
| Flyout | |
| Other values (17) |
Length
| Max length | 19 |
|---|---|
| Median length | 9 |
| Mean length | 7.958726415 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hit By Pitch |
|---|---|
| 2nd row | Hit By Pitch |
| 3rd row | Hit By Pitch |
| 4th row | Strikeout |
| 5th row | Strikeout |
Common Values
| Value | Count | Frequency (%) |
| Strikeout | 1390 | |
| Groundout | 583 | |
| Single | 373 | 11.0% |
| Walk | 236 | 7.0% |
| Flyout | 200 | 5.9% |
| Lineout | 175 | 5.2% |
| Pop Out | 117 | 3.4% |
| Double | 90 | 2.7% |
| Forceout | 65 | 1.9% |
| Home Run | 53 | 1.6% |
| Other values (12) | 110 | 3.2% |
Length
| Value | Count | Frequency (%) |
| strikeout | 1398 | |
| groundout | 595 | |
| single | 373 | 10.0% |
| walk | 240 | 6.4% |
| flyout | 200 | 5.4% |
| lineout | 175 | 4.7% |
| out | 138 | 3.7% |
| pop | 121 | 3.3% |
| double | 92 | 2.5% |
| forceout | 65 | 1.7% |
| Other values (19) | 324 | 8.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 33 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201501214.7 |
| Minimum | 201500012 |
|---|---|
| Maximum | 201502425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 201500012 |
|---|---|
| 5-th percentile | 201500067 |
| Q1 | 201500606 |
| median | 201501198 |
| Q3 | 201501842 |
| 95-th percentile | 201502276 |
| Maximum | 201502425 |
| Range | 2413 |
| Interquartile range (IQR) | 1236 |
Descriptive statistics
| Standard deviation | 710.1755483 |
|---|---|
| Coefficient of variation (CV) | 3.524423162 × 10-6 |
| Kurtosis | -1.199796231 |
| Mean | 201501214.7 |
| Median Absolute Deviation (MAD) | 644 |
| Skewness | -0.03006276692 |
| Sum | 6.834921203 × 1011 |
| Variance | 504349.3095 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 201501987 | 132 | 3.9% |
| 201501271 | 123 | 3.6% |
| 201500907 | 117 | 3.4% |
| 201501772 | 116 | 3.4% |
| 201501540 | 115 | 3.4% |
| 201501842 | 111 | 3.3% |
| 201500525 | 110 | 3.2% |
| 201500460 | 110 | 3.2% |
| 201501906 | 108 | 3.2% |
| 201500987 | 107 | 3.2% |
| Other values (23) | 2243 |
| Value | Count | Frequency (%) |
| 201500012 | 99 | |
| 201500067 | 99 | |
| 201500147 | 104 | |
| 201500216 | 93 | |
| 201500301 | 92 | |
| 201500374 | 91 | |
| 201500460 | 110 | |
| 201500525 | 110 | |
| 201500606 | 99 | |
| 201500675 | 101 |
| Value | Count | Frequency (%) |
| 201502425 | 60 | |
| 201502349 | 104 | |
| 201502276 | 80 | |
| 201502217 | 100 | |
| 201502142 | 105 | |
| 201502063 | 106 | |
| 201501987 | 132 | |
| 201501906 | 108 | |
| 201501842 | 111 | |
| 201501772 | 116 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.198113208 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.176038238 |
|---|---|
| Coefficient of variation (CV) | 0.5183371983 |
| Kurtosis | -0.9660017255 |
| Mean | 4.198113208 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1527614284 |
| Sum | 14240 |
| Variance | 4.735142414 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 559 | |
| 1 | 482 | |
| 6 | 459 | |
| 2 | 437 | |
| 5 | 434 | |
| 3 | 424 | |
| 7 | 387 | |
| 8 | 147 | 4.3% |
| 9 | 63 | 1.9% |
| Value | Count | Frequency (%) |
| 1 | 482 | |
| 2 | 437 | |
| 3 | 424 | |
| 4 | 559 | |
| 5 | 434 | |
| 6 | 459 | |
| 7 | 387 | |
| 8 | 147 | 4.3% |
| 9 | 63 | 1.9% |
| Value | Count | Frequency (%) |
| 9 | 63 | 1.9% |
| 8 | 147 | 4.3% |
| 7 | 387 | |
| 6 | 459 | |
| 5 | 434 | |
| 4 | 559 | |
| 3 | 424 | |
| 2 | 437 | |
| 1 | 482 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1139 | |
| 2 | 1104 | |
| 3 | 859 | |
| 0 | 290 | 8.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 1139 | |
| 2 | 1104 | |
| 3 | 859 | |
| 0 | 290 | 8.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.521521226 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 1318 |
| Zeros (%) | 38.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.906578729 |
|---|---|
| Coefficient of variation (CV) | 1.253074026 |
| Kurtosis | 2.202313762 |
| Mean | 1.521521226 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.613558561 |
| Sum | 5161 |
| Variance | 3.635042451 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1318 | |
| 1 | 848 | |
| 2 | 569 | |
| 3 | 178 | 5.2% |
| 5 | 149 | 4.4% |
| 4 | 128 | 3.8% |
| 6 | 116 | 3.4% |
| 8 | 86 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 1318 | |
| 1 | 848 | |
| 2 | 569 | |
| 3 | 178 | 5.2% |
| 4 | 128 | 3.8% |
| 5 | 149 | 4.4% |
| 6 | 116 | 3.4% |
| 8 | 86 | 2.5% |
| Value | Count | Frequency (%) |
| 8 | 86 | 2.5% |
| 6 | 116 | 3.4% |
| 5 | 149 | 4.4% |
| 4 | 128 | 3.8% |
| 3 | 178 | 5.2% |
| 2 | 569 | |
| 1 | 848 | |
| 0 | 1318 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| L |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | L |
|---|---|
| 2nd row | L |
| 3rd row | L |
| 4th row | L |
| 5th row | L |
Common Values
| Value | Count | Frequency (%) |
| L | 3392 |
Length
Pie chart
| Value | Count | Frequency (%) |
| l | 3392 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 477132 |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 477132 |
|---|---|
| 2nd row | 477132 |
| 3rd row | 477132 |
| 4th row | 477132 |
| 5th row | 477132 |
Common Values
| Value | Count | Frequency (%) |
| 477132 | 3392 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 477132 | 3392 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| R | |
|---|---|
| L |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | R |
|---|---|
| 2nd row | R |
| 3rd row | R |
| 4th row | R |
| 5th row | R |
Common Values
| Value | Count | Frequency (%) |
| R | 2535 | |
| L | 857 | 25.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| r | 2535 | |
| l | 857 | 25.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.8 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 1745 | |
| False | 1647 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| Minimum | 2015-04-06 00:00:00 |
|---|---|
| Maximum | 2015-10-04 00:00:00 |
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| B | |
|---|---|
| F | |
| C | |
| S | |
| X | |
| Other values (9) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.029775943 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | F |
|---|---|
| 2nd row | C |
| 3rd row | H |
| 4th row | B |
| 5th row | C |
Common Values
| Value | Count | Frequency (%) |
| B | 981 | |
| F | 624 | |
| C | 558 | |
| S | 466 | |
| X | 369 | 10.9% |
| D | 128 | 3.8% |
| *B | 101 | 3.0% |
| W | 73 | 2.2% |
| E | 45 | 1.3% |
| T | 28 | 0.8% |
| Other values (4) | 19 | 0.6% |
Length
| Value | Count | Frequency (%) |
| b | 1082 | |
| f | 624 | |
| c | 558 | |
| s | 466 | |
| x | 369 | 10.9% |
| d | 128 | 3.8% |
| w | 73 | 2.2% |
| e | 45 | 1.3% |
| t | 28 | 0.8% |
| l | 11 | 0.3% |
| Other values (3) | 8 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| S | |
|---|---|
| B | |
| X |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | S |
| 3rd row | B |
| 4th row | B |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 1761 | |
| B | 1089 | |
| X | 542 | 16.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| s | 1761 | |
| b | 1089 | |
| x | 542 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6 |
| Missing (%) | 0.2% |
| Memory size | 53.0 KiB |
| FF | |
|---|---|
| SL | |
| CU | |
| FT | 107 |
| CH | 8 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FF |
|---|---|
| 2nd row | FF |
| 3rd row | FF |
| 4th row | FF |
| 5th row | FF |
Common Values
| Value | Count | Frequency (%) |
| FF | 1722 | |
| SL | 932 | |
| CU | 616 | 18.2% |
| FT | 107 | 3.2% |
| CH | 8 | 0.2% |
| IN | 1 | < 0.1% |
| (Missing) | 6 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| ff | 1722 | |
| sl | 932 | |
| cu | 616 | 18.2% |
| ft | 107 | 3.2% |
| ch | 8 | 0.2% |
| in | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 522 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 229.7777123 |
| Minimum | 3 |
|---|---|
| Maximum | 560 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 107 |
| median | 223 |
| Q3 | 345 |
| 95-th percentile | 464.45 |
| Maximum | 560 |
| Range | 557 |
| Interquartile range (IQR) | 238 |
Descriptive statistics
| Standard deviation | 140.8944469 |
|---|---|
| Coefficient of variation (CV) | 0.6131771683 |
| Kurtosis | -1.018578309 |
| Mean | 229.7777123 |
| Median Absolute Deviation (MAD) | 119 |
| Skewness | 0.1726221885 |
| Sum | 779406 |
| Variance | 19851.24518 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 17 | 0.5% |
| 4 | 15 | 0.4% |
| 5 | 14 | 0.4% |
| 50 | 14 | 0.4% |
| 11 | 14 | 0.4% |
| 51 | 13 | 0.4% |
| 297 | 13 | 0.4% |
| 218 | 13 | 0.4% |
| 166 | 12 | 0.4% |
| 165 | 12 | 0.4% |
| Other values (512) | 3255 |
| Value | Count | Frequency (%) |
| 3 | 17 | |
| 4 | 15 | |
| 5 | 14 | |
| 6 | 9 | |
| 7 | 9 | |
| 8 | 6 | 0.2% |
| 9 | 9 | |
| 10 | 10 | |
| 11 | 14 | |
| 12 | 11 |
| Value | Count | Frequency (%) |
| 560 | 1 | |
| 559 | 2 | |
| 558 | 2 | |
| 557 | 1 | |
| 556 | 1 | |
| 554 | 1 | |
| 553 | 1 | |
| 550 | 1 | |
| 549 | 2 | |
| 548 | 2 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8166273585 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 1862 |
| Zeros (%) | 54.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.154778287 |
|---|---|
| Coefficient of variation (CV) | 1.414082292 |
| Kurtosis | 2.055413806 |
| Mean | 0.8166273585 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.574418781 |
| Sum | 2770 |
| Variance | 1.333512892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1862 | |
| 1 | 816 | |
| 2 | 382 | 11.3% |
| 3 | 185 | 5.5% |
| 4 | 100 | 2.9% |
| 5 | 47 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 1862 | |
| 1 | 816 | |
| 2 | 382 | 11.3% |
| 3 | 185 | 5.5% |
| 4 | 100 | 2.9% |
| 5 | 47 | 1.4% |
| Value | Count | Frequency (%) |
| 5 | 47 | 1.4% |
| 4 | 100 | 2.9% |
| 3 | 185 | 5.5% |
| 2 | 382 | 11.3% |
| 1 | 816 | |
| 0 | 1862 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | |
| 3.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1664 | |
| 1.0 | 1019 | |
| 2.0 | 505 | 14.9% |
| 3.0 | 204 | 6.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 1664 | |
| 1.0 | 1019 | |
| 2.0 | 505 | 14.9% |
| 3.0 | 204 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 2.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 2.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1286 | |
| 2.0 | 1061 | |
| 1.0 | 1045 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 1286 | |
| 2.0 | 1061 | |
| 1.0 | 1045 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1193 | |
| 1.0 | 1123 | |
| 2.0 | 1076 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 1193 | |
| 1.0 | 1123 | |
| 2.0 | 1076 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.840801887 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.681164674 |
|---|---|
| Coefficient of variation (CV) | 0.5917922971 |
| Kurtosis | 0.5306867948 |
| Mean | 2.840801887 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9040000346 |
| Sum | 9636 |
| Variance | 2.826314662 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 897 | |
| 2 | 775 | |
| 3 | 667 | |
| 4 | 494 | |
| 5 | 310 | 9.1% |
| 6 | 140 | 4.1% |
| 7 | 66 | 1.9% |
| 8 | 30 | 0.9% |
| 9 | 10 | 0.3% |
| 10 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 897 | |
| 2 | 775 | |
| 3 | 667 | |
| 4 | 494 | |
| 5 | 310 | 9.1% |
| 6 | 140 | 4.1% |
| 7 | 66 | 1.9% |
| 8 | 30 | 0.9% |
| 9 | 10 | 0.3% |
| 10 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 10 | 3 | 0.1% |
| 9 | 10 | 0.3% |
| 8 | 30 | 0.9% |
| 7 | 66 | 1.9% |
| 6 | 140 | 4.1% |
| 5 | 310 | 9.1% |
| 4 | 494 | |
| 3 | 667 | |
| 2 | 775 | |
| 1 | 897 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 2541 | |
| 1.0 | 851 | 25.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 2541 | |
| 1.0 | 851 | 25.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 2958 | |
| 1.0 | 434 | 12.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 2958 | |
| 1.0 | 434 | 12.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 217 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3175 | |
| 1.0 | 217 | 6.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 3175 | |
| 1.0 | 217 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7 |
| Missing (%) | 0.2% |
| Memory size | 53.0 KiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | |
| 4.0 | 107 |
| 5.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1722 | |
| 2.0 | 932 | |
| 3.0 | 616 | 18.2% |
| 4.0 | 107 | 3.2% |
| 5.0 | 8 | 0.2% |
| (Missing) | 7 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1722 | |
| 2.0 | 932 | |
| 3.0 | 616 | 18.2% |
| 4.0 | 107 | 3.2% |
| 5.0 | 8 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| ab_id | batter_id | event | g_id | inning | o | p_score | p_throws | pitcher_id | stand | top | date | code | type | pitch_type | event_num | b_score | b_count | s_count | outs | pitch_num | on_1b | on_2b | on_3b | pcodes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015000768 | 571976 | Hit By Pitch | 201500012 | 1 | 0 | 0 | L | 477132 | R | True | 2015-04-06 | F | S | FF | 3 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 1 | 2015000768 | 571976 | Hit By Pitch | 201500012 | 1 | 0 | 0 | L | 477132 | R | True | 2015-04-06 | C | S | FF | 4 | 0.0 | 0.0 | 1.0 | 0.0 | 2.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 2 | 2015000768 | 571976 | Hit By Pitch | 201500012 | 1 | 0 | 0 | L | 477132 | R | True | 2015-04-06 | H | B | FF | 5 | 0.0 | 0.0 | 2.0 | 0.0 | 3.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 3 | 2015000769 | 519083 | Strikeout | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | B | B | FF | 8 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 4 | 2015000769 | 519083 | Strikeout | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | C | S | FF | 9 | 0.0 | 1.0 | 0.0 | 0.0 | 2.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 5 | 2015000769 | 519083 | Strikeout | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | C | S | CU | 10 | 0.0 | 1.0 | 1.0 | 0.0 | 3.0 | 1.0 | 0.0 | 0.0 | 3.0 |
| 6 | 2015000769 | 519083 | Strikeout | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | B | B | FF | 11 | 0.0 | 1.0 | 2.0 | 0.0 | 4.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 7 | 2015000769 | 519083 | Strikeout | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | S | S | CU | 12 | 0.0 | 2.0 | 2.0 | 0.0 | 5.0 | 1.0 | 0.0 | 0.0 | 3.0 |
| 8 | 2015000770 | 461314 | Single | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | B | B | FF | 16 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 |
| 9 | 2015000770 | 461314 | Single | 201500012 | 1 | 1 | 0 | L | 477132 | R | True | 2015-04-06 | S | S | SL | 17 | 0.0 | 1.0 | 0.0 | 1.0 | 2.0 | 1.0 | 0.0 | 0.0 | 2.0 |
Last rows
| ab_id | batter_id | event | g_id | inning | o | p_score | p_throws | pitcher_id | stand | top | date | code | type | pitch_type | event_num | b_score | b_count | s_count | outs | pitch_num | on_1b | on_2b | on_3b | pcodes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3382 | 2015183799 | 500208 | Strikeout | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | F | S | FF | 198 | 0.0 | 1.0 | 2.0 | 1.0 | 6.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 3383 | 2015183799 | 500208 | Strikeout | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | F | S | CU | 199 | 0.0 | 1.0 | 2.0 | 1.0 | 7.0 | 0.0 | 0.0 | 0.0 | 3.0 |
| 3384 | 2015183799 | 500208 | Strikeout | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | F | S | SL | 200 | 0.0 | 1.0 | 2.0 | 1.0 | 8.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 3385 | 2015183799 | 500208 | Strikeout | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | B | B | SL | 201 | 0.0 | 1.0 | 2.0 | 1.0 | 9.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 3386 | 2015183799 | 500208 | Strikeout | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | S | S | SL | 202 | 0.0 | 2.0 | 2.0 | 1.0 | 10.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 3387 | 2015183800 | 576397 | Single | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | B | B | CU | 206 | 0.0 | 0.0 | 0.0 | 2.0 | 1.0 | 0.0 | 0.0 | 0.0 | 3.0 |
| 3388 | 2015183800 | 576397 | Single | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | T | S | SL | 207 | 0.0 | 1.0 | 0.0 | 2.0 | 2.0 | 0.0 | 0.0 | 0.0 | 2.0 |
| 3389 | 2015183800 | 576397 | Single | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | B | B | FF | 208 | 0.0 | 1.0 | 1.0 | 2.0 | 3.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 3390 | 2015183800 | 576397 | Single | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | B | B | FF | 209 | 0.0 | 2.0 | 1.0 | 2.0 | 4.0 | 0.0 | 0.0 | 0.0 | 1.0 |
| 3391 | 2015183800 | 576397 | Single | 201502425 | 4 | 2 | 2 | L | 477132 | R | True | 2015-10-04 | D | X | FF | 210 | 0.0 | 3.0 | 1.0 | 2.0 | 5.0 | 0.0 | 0.0 | 0.0 | 1.0 |